Optimal discrimination of quantum states with a fixed rate of inconclusive outcomes 
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In this paper we present the solution to the problem of optimally discriminating among quantum 
states, i.e., identifying the states with maximum probability of success when a certain fixed rate of 
inconclusive answers is allowed. By varying the inconclusive rate, the scheme optimally interpolates 
between Unambiguous and Minimum Error discrimination, the two standard approaches to quantum 
state discrimination. We introduce a very general method that enables us to obtain the solution in 
a wide range of cases and give a complete characterization of the minimum discrimination error as a 
function of the rate of inconclusive answers. A critical value of this rate is identified that coincides 
with the minimum failure probability in the cases where unambiguous discrimination is possible and 
provides a natural generalization of it when states cannot be unambiguously discriminated. The 
method is illustrated on two explicit examples: discrimination of two pure states with arbitrary 
prior probabilities and discrimination of trine states. 
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State discrimination has long been recognized to play 
a central role in quantum information and quantum com- 
puting. In these fields the information is encoded in the 
state of quantum systems, thus, one often needs to iden- 
tify in which of N known states {pi}fLi one such sys- 
tem was prepared. If the possible states are mutually 
orthogonal this is an easy task: we just set up detectors 
along these orthogonal directions and determine which 
one clicks (assuming perfect detectors). However, if the 
states are not mutually orthogonal the problem is highly 
nontrivial and optimization with respect to some reason- 
able criteria leads to complex strategies often involving 
generalized measurements. Finding such optimal strate- 
gies is the subject of state discrimination. 

The two fundamental state discrimination strategies 
are discrimination with minimum-error (ME) and unam- 
biguous discrimination (UD). In ME, every time a sys- 
tem is given and a measurement is performed on it a 
conclusion must be drawn about its state. Accordingly, 
the measurement is described by a TV-element positive 
operator valued measure (POVM) n = {rij}^, where 
each element represents a conclusive outcome. Errors 
are permitted and in the optimal strategy the probabil- 
ity of making an error is minimized In UD, no er- 
rors are tolerated but at the expense of permitting an 
inconclusive measurement outcome, represented by the 
positive operator no. Hence, the corresponding POVM 
is n = {nj}^-Q. When Ho clicks we do not learn any- 
thing about the state of the system and in the optimal 
strategy the probability of the inconclusive outcome is 
minimized [2] . It has been recognized that states can be 
discriminated unambiguously only if they are linearly in- 
dependent Q . Discrimination with maximum confidence 
(MC) can be applied to states that are not necessarily in- 
dependent and for linearly independent states it reduces 



to unambiguous discrimination 0, [B[ , so the MC scheme 
can be regarded as a generalized UD strategy. 

It is clear that UD (or MC for linearly dependent 
states) and ME are the two extremes of a more general 
scheme that can be approached by relaxing the condi- 
tions at either end. That is, we may reduce the opti- 
mal error rate in the ME approach by allowing certain 
fixed rate Q of inconclusive outcomes [fiHsj]; or, starting 
from UD, allow for some fixed rate P e of errors to oc- 
LCH12I] and hence reduce Q. The first approach yields 



the minimal error rate as a function of the given inconclu- 
sive rate P™ m (Q), while the second yields the minimum 
Q as a function of the given error rate Q min (P e ). Since 
the two expressions are the result of optimizations with 
different conditions, it is not immediately obvious, yet 
true, that one is the inverse of the other Notice 
that these general scenarios encompass many practical 
situations, where resources are scarce and one can only 
afford a limited rate of inconclusive outcomes, or where 
the error rate must be kept below certain level but need 
not be strictly vanishing. 

The function P™ m (Q) has never been solved in full 
generality. Analytical solutions have been provided for 
two pure states with equal occurrence probabilities, oth- 
erwise only useful bounds were established [6|-|9|. The 
second approach has recently been solved for the case 
of two pure states with arbitrary occurrence probabili- 
ties [l2j, but it is not obvious whether the method can 
be extended to other cases involving, e.g., mixed states, 
more than two states, etc. Furthermore the connection 
with the first approach remained unnoticed, and so re- 
mains the connection with the MC scheme. 

Here we state some fundamental features of the Fixed 
Rate of Inconclusive Outcome (FRIO) scheme, in partic- 
ular of the function P™ ln (Q), and show its links to the 



2 



other schemes discussed above. We thus give a unified 
and quite complete picture of quantum state discrimi- 
nation. Perhaps most important, we also present a very 
general method to obtain P™ n (Q). This in turn provides 
also the solution, Q mm (P e ), to the second approach. 

The idea is to first introduce a suitable transforma- 
tion which formally eliminates the inconclusive part of 
the problem and reduces it to a ME problem. Since the 
optimal solution to the ME problem is known for many 
special cases, we can immediately adopt it as our starting 
point. This formal solution still depends on free parame- 
ters of the transformation. In the next step we carry out 
a second optimization with respect to these free param- 
eters which then yields the complete solution. 

For clarity's sake, we will introduce FRIO discrimina- 
tion on the case of two states, p\ and P2, with general a 
priori probabilities 771 and 772 such that 771+772 = 1- The 
generalization to more than two states is straightforward. 
We introduce a three element POVM II = {n , ITi, n 2 }, 
with 111 + II2 + ITo = I, where IIx(2) identifies p\<2), 
while IIo corresponds to the inconclusive outcome. The 
average success, error and inconclusive probabilities are: 



P s = trfapilli) 
P e = trfapilla) 
Q = tr(pn ), 



tr (772/O2II2) 
tr (/72P2IT1) 



(1) 

(2) 
(3) 



where p = 771 pi +7/2/O2, and we assume a fixed Q. Clearly 
we have P s + P e + Q = 1 . The optimal strategy minimizes 
P e under the constraint that Q is fixed, yielding P e mm (<3). 

It follows from the above definitions that P e mm (Q) is 
a convex function. To show this, let us assume that 
n' (II") is the optimal POVM for a rate Q' (Q") of 
inconclusive outcomes. Then, the mixed strategy that 
consists in performing the measurement IT' (II") with 
probability p (p = 1 — p), i.e., the measurement strat- 
egy given by the POVM pH' +J5II", has error probabil- 
ity pP e min (<3') +pPf m {Q") for a rate Q =pQ> + pQ" of 
inconclusive outcomes. Since pYl+pH' is not necessarily 
optimal for Q, we have the convexity inequality, 

Pf n {Q= P Q' + pQ")< P Pf n {Q') + pP e min (Q"). (4) 

Convexity implies the following useful properties [la ]: 
i) P min (Q) is continuous in (0,1]; ii) P min (Q) is differ- 
entiable in (0, 1) except, at most, at countably many 
points; iii) Left and right derivatives exist, they sat- 
isfy (PJ nin )'(g_) < (P e min )'(Q+), and are monotoni- 
cally non-decreasing functions of Q; iv) In addition, 
since P e min (Q) > and P e min (l) = 0, we have 
that (P e min )'(Q±) < and P e min (Q) is non-increasing. 

Because of these properties, there must exist a critical 
rate Q c such that for Q c < Q < 1 the right deriva- 
tive (PJ mn )'(Q+) is constant and takes its maximum 
value -a = (P e min )'(Q c+ ). Hence, 



Note that P e min (Q)/(l - Q) is the error probability con- 
ditioned on obtaining a conclusive answer. Eq. ([5]) states 
that this conditioned probability becomes a constant for 
inconclusive rates larger than Q c . The quantity a = I— a 
can be interpreted as a confidence C. Indeed, for sym- 
metric states, such as those in our second example be- 
low, a is the maximum confidence and Q c coincides with 
the minimum rate of inconclusive outcome in the MC 
scheme: Q c = Q^ c . Other links with the MC scheme 
will be given in [lj]. Note that when the states can be 
unambiguously discriminated, as in the case of two pure 
sates, we have a = (C = a = 1); Q c is equal to the 
optimal inconclusive (or failure) probability in the UD 
scheme: Q c = Q UD ; and P e min (Q) = for Q c < Q < 1. 

In order to obtain optimal FRIO measurement strate- 
gies, it suffices to find optimal POVMs in the region 
of Q where P e mm (Q) is strictly convex. Outside this re- 
gion, the proof given above shows that the best measure- 
ments will be trivial convex combinations of those opti- 
mal POVMs, i.e., mixed strategies. This provides an im- 
portant simplification to our optimization problem (see 
examples below) through the following theorem: At val- 
ues of Q such that P™ m (Q) is strictly convex (i.e., where 
optimal strategies are pure), the element IIo °f the opti- 
mal POVM necessarily has a zero eigenvalue. 

This is so because if all the eigenvalues of IIo were non- 
zero, it could be written as IIo = pir o + pi f° r some val- 
ues of p, < p < 1, and some non- negative operator n o . 
Note that II = pH' +pll", where IT = {Yl^p- 1 !!^)} 
and n" = {1, 0, 0} are proper POVMs (the latter is the 
trivial strategy with no conclusive outcomes). Thus, 
II would define a mixed, rather than a pure, strategy 
and P™ m (<2) would not be strictly convex. 

We next show how to transform the problem of finding 
the optimal FRIO strategy to a minimum error problem 
with no inconclusive outcome. The starting point is to 
write ITi + n 2 = I — ITo = fi. Multiplying both sides of 
this equation by fi -1 / 2 we have ITi + IT2 = I, where 
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rr 1 / 2 ^ rr 1 / 2 >o. 



(6) 



So {fli, n 2 } is a POVM. Note that fT 1/2 exists unless n 
has a unit eigenvalue, in which case we address the prob- 
lem differently (see later) . Defining the normalized trans- 
formed states pi and a priori probabilities % as 



Pi 



tr (Qpi) 



Q ' 



(7) 



where Q = 1 — Q, the success and error probabilities 
read P s(e) = QP s(e ), where 



P e = tr(y7ipin 2 ) +tr(772/52iTi), 



(8) 



P e min (Q) = a(l - Q), Q C <Q< 1. 



(5) 



and P s = 1 - P e . Equation © and fii + n 2 = I de- 
fine a ME discrimination problem for the transformed 
states and priors given in Eq. ([7]). Further minimization 
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over the choice of IIo, such that tr/TIo = Q, gives the 
desired result P e min (Q) = minQ P e ME . The optimal solu- 
tion to this ME discrimination problem is well known [l| : 
P e ME = (1- || r72/5 2 - fjtpt ||i)/2, where || -||i is the trace 
norm. An analogous formula does not exist for more 
than two states. However, explicit solutions to the ME 
discrimination of N states {pi}^ =1 are known in some 
particular cases, e.g., symmetric states. Then one can 
carry out the last minimization over IIo. 

We next illustrate the method on the example of two 
pure states, p\ = |V'i)(V'i| an d Pi = \ip2) (ip2\ with general 
a priori probabilities 771 and 772. Since pi — \ipi)(ipi\ are 
also pure states, we can use the more explicit expression 



1 - v 1 -47717)21(^11^2) I 2 



(9) 



Since two pure states can be unambiguously discrimi- 
nated, the critical probability Q c coincides with the op- 
timal failure probability Q UD which reads [T3| 



Qc = < 



771 + 772 cos 2 9, if 771 < 
m + Vi cos2 if 771 > 



COS" 



= „(Q 
'II J 



1 + cos 2 9 

1 - J r ) 



1 + cos 2 e 



(10) 



, 2^/771772" cos 6 = Qq, if r){' < m < 



where |(V'i|V'2)| = cos 9 is the overlap of the states. 

We now set out to find the optimal pure strategy, i.e., 
the optimal POVM that will minimize P e for a fixed 
Q in the interval < Q < Q c . As the Hilbert space 
spanned by two pure states is two-dimensional and the 
optimal IIo has a zero eigenvalue, it is effectively a posi- 
tive rank one operator. We denote the positive eigenvalue 
by £, < £ < 1, the eigenstate belonging to £ by |0) and 
the orthogonal state by |1). In this basis IIo = £|0)(0|, 
and Q = £|0}(0| + |1)(1|, where £ = If we also write 

the input states in this basis, \ipi) = Ci\0) + Si\l), where 
Ci = cos 9i, Si = sin 8i, the transformed states and priors 
can be trivially obtained from Eq. ([7]) , and after obvious 
simplifications we can write 



ME 



Q- \ Q 2 - 477i77 2 (cos6» - £cic 2 ) 



(11) 



where we used that 9\ — 62 = 9- It follows from ^ that 

Q 



vi4 + mc 2 2 ' 



(12) 



Hence Eq. (|lip depends only on one parameter, say 6\, 
which determines the orientation of Ho relative to that 
of the two pure states. The minimization over Ho 
(equivalently, over 6\) simplifies considerably using 
Eq. (TT2")) and defining Ci^ 2 (rjic\ + 77 2 c 2 )~ 1//2 = cos<y9, 
and C2 772^ 2 (t7ic 2 + ^c^)^ 1 / 2 = sin ip. The resulting ex- 
pression is minimum for if — 7r/4, yielding 



P 



min / max 
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e/s =^{Qt\/Q 2 -(Qo-Q) }, Q<Qo, (13) 



where Qo was introduced in the third line of (flQ)) . This is 
the optimal error/success rate for an intermediate range 
of the prior probabilities. One can invert P" un (Q) to ob- 
tain Q and, in turn, P s max as function of P e , P s max (F e ) = 
[Pe /2 + (1 - Qo) 1/2 ] 2 , in agreement with [ll|. 

For the validity of these results, £ < 1 must hold. 
This condition determines the range of priors for which 
Eq. (fT3|) is valid. The definitions of cos <p and sin ip 
after Eq. (TT2")) give y/rj 2 C2 — y/v^i for ip = tt/4. 
Some straightforward algebra leads to 771c 2 = 772 c! = 
771772 sin 2 9/(1 - Qq), and Eq. (fT2]) yields £ = (1 - 
Qo)Q/(2t7i772 sin 2 9). Setting £ = 1 defines a threshold, 



Q th = 2771772 sin 2 9/(1 -Qo). 



(14) 



Hence £ < 1 if Q < Qth and £ = 1 if Q > Qth- 

In Fig. Q] we plot Q c and Qth vs. 7/1 together for a fixed 
overlap, cos 9 — 0.5 (9 = 7r/3). The two curves inter- 
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FIG. 1: Q c (solid line) and Qth (dashed line) vs. 771 for 
9 — 7r/3. The area of interest lies under the solid line. The 
regions I, II and III are defined in Eq. (1101) . 



sect at ?/i = 77^ and 771 = rj[ r \ the same points as in 
Eq. pU|) . The interval < 771 < 1 is thus divided into 
three regions. In regions I and III, we have Qth < Qc 
and the solution (| 13[) is valid for < Q < Qth only. In 



Region II, 77^ < 771 < 77^, we have Q c = Qo < Qth and 
the solution (|T5]) is valid for the entire < Q < Q c range. 

In the shaded parts of regions I and II one has Qth < 
Q < Q c and, necessarily, £ = 1. Hence, n = |0)(0| and 
il = |1) (1| are projectors. Therefore, fi -1 / 2 does not exist 
in these areas and the case needs special consideration. 

The calculation of the error probability is most easily 
performed by realizing that Hi and n2 become degen- 
erate, both must be proportional to Ud = |1)(1|. The 
three-element POVM becomes a standard two element 
projective measurement, {n^ = |l)(l|,n = |0)(0|}. We 
identify a click in with p\ (p2) if 771 > 772 (772 > 771), so 
Pe(s) = msl, P s (e) = Visl, with Q = 1-P e -P s . These 
equations completely determine the solution. There is 
nothing to optimize here, so we drop the superscript min 
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in what follows. 8\ — 82 — 8 immediately gives Q(P e ) as 



Q = l-P e -rhtJJ—coaO±Jl — — sinfl) . (15) 

W^l) V %(!) J 

Inverting this equation gives P e {Q) but the resulting ex- 
pression is not particularly insightful and we will not give 
it here. Note that for P e — (UD limit) one has Q = Q Cl 
given by the first and second lines in Eq. (|10|) . and for 
Q = Qth, P e reduces to (fP3"j) . as it should. 

Let us now briefly discuss a second example with lin- 
early dependent states, which is interesting because such 
states cannot be unambiguously discriminated. Consider 
the trine qubit states \ipk) = cos^lO) + e~^ k smO |1), 
< 8 < 7r/4, with equal prior probabilities r/k = 1/3, 
k = 1, 2, 3. Note that 28 is the polar angle on the Bloch 
sphere. The set of signal states is covariant with re- 
spect to the abelian group of unitaries {I, u,u 2 }, where 
u = |0)(0| + e~3 ZL |l)(l|, and this implies that n can 
be chosen diagonal in the basis {|0),|1)}. Moreover, 
since an optimal pure strategy requires that n has a 
zero eigenvalue, we must have IIo = £|0)(0| (the other 
possible choice, IIo = £|1)(1|, turns out not to be opti- 
mal). Using Eq. ([7]) one easily obtains \ipk) — cos0 |0) + 
eTr fe sin(?|l), with cosfl = £ 1 / 2 (^cos 2 + sin 2 e)- 1 / 2 cos(9 
and fjk = r/k — 1/3, i.e., the transformed states are 
themselves trine states with polar angle 28. Eq. ([3]) 
gives Q = £ cos 2 8, where we have used that the av- 
eraged density matrix p for the trine states is p — 
(1/3) Ylk=i IV'fc)(V'fe|i hence £ is determined, and we sim- 
ply have P™ ln {Q) = QP^, as no minimization over IIo 
is possible. After some algebra we can rewrite cos 8 
as cos 8 = Q-i/^cos 2 ^- Q) 1 / 2 , and sinfl = Q-^sinfl. 
Substituting in P e ME = (2 - sin20)/3 (see, e.g., 0) we 
obtain 

^e ni "(Q) = \ (p ~ smS^cos? 8 -Q^j , Q < Q c . (16) 

To calculate Q c we use Eq. (J5J) for Q = Q c . For the 
case at hand it reads P min (Q c ) = (Q c - l){P™ in )' {Q c ), 
where we have used that P™ m (Q), defined in Eq. p^|) . is 
differentiable. The solution is Q c = cos 28, which in turn 
yields a = 1/3. Then, P e min (Q) is given by Eq. © with 
these particular values of a, and Q c . The latter and a = 
2 /3 are both in agreement with the values of the optimal 
failure probability <5 MC and the maximum confidence C, 
respectively, for the trine states in Q. These results are 
illustrated in Fig. [5] They exemplify the link between 
MC and FRIO schemes. 

To summarize, we introduced a very general transfor- 
mation in Eqs. ([6J and (O that turns every problem with 
fixed inconclusive rate into an equivalent ME problem. 
When the solution of the resulting ME problem is known 
one can optimize it over the free parameters of the trans- 
formation. In some special cases, including symmetric 




FIG. 2: Trine-state minimum error P™ m (Q) (solid line) and 
minimum error conditioned on obtaining a conclusive out- 
come P e min (Q)/Q (dashed-dotted line). For Q > Q c , the two 
lines become straight, with a slope of a = 1 — C = 1/3 resp. 0. 
For the plot 6 = tt/10. 



states [3, 17 1 or two mixed states whose density ma- 
trices are diagonal in the Jordan basis 18j, this can be 
done analytically. We have identified a critical value Q c 
of inconclusive rate that generalizes the notion of failure 
probability used in UD to other cases where UD cannot 
be applied, such as discrimination of linearly dependent 
states or full rank mixed states. We note that related 
work has been done independently by Ulrike Herzog [3] . 
We will present further details in a separate publication 
14j . The method we presented here is very powerful and 
can be applied in many other cases, including quantum 
state estimation with post processing (2pj ] . 
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